73 research outputs found

    Modelling BMI trajectories in children for genetic association studies

    Get PDF
    Background: The timing of associations between common genetic variants and changes in growth patterns over childhood may provide insight into the development of obesity in later life. To address this question, it is important to define appropriate statistical models to allow for the detection of genetic effects influencing longitudinal childhood growth. Methods and Results: Children from The Western Australian Pregnancy Cohort (Raine; n = 1,506) Study were genotyped at 17 genetic loci shown to be associated with childhood obesity (FTO, MC4R, TMEM18, GNPDA2, KCTD15, NEGR1, BDNF, ETV5, SEC16B, LYPLAL1, TFAP2B, MTCH2, BCDIN3D, NRXN3, SH2B1, MRSA) and an obesity-risk-allele-score was calculated as the total number of ‘risk alleles’ possessed by each individual. To determine the statistical method that fits these data and has the ability to detect genetic differences in BMI growth profile, four methods were investigated: linear mixed effects model, linear mixed effects model with skew-t random errors, semi-parametric linear mixed models and a non-linear mixed effects model. Of the four methods, the semi-parametric linear mixed model method was the most efficient for modelling childhood growth to detect modest genetic effects in this cohort. Using this method, three of the 17 loci were significantly associated with BMI intercept or trajectory in females and four in males. Additionally, the obesity-risk-allele score was associated with increased average BMI (female: β = 0.0049, P = 0.0181; male: β = 0.0071, P = 0.0001) and rate of growth (female: β = 0.0012, P = 0.0006; male: β = 0.0008, P = 0.0068) throughout childhood. Conclusions: Using statistical models appropriate to detect genetic variants, variations in adult obesity genes were associated with childhood growth. There were also differences between males and females. This study provides evidence of genetic effects that may identify individuals early in life that are more likely to rapidly increase their BMI through childhood, which provides some insight into the biology of childhood growth.Nicole M. Warrington, Yan Yan Wu, Craig E. Pennell, Julie A. Marsh, Lawrence J. Beilin, Lyle J. Palmer, Stephen J. Lye, Laurent Briollai

    Fat mass and obesity-associated obesity-risk genotype is associated with lower foetal growth: An effect that is reversed in the offspring of smoking mothers

    Get PDF
    Fat mass and obesity-associated (FTO) gene variants are associated with childhood and adult obesity; however, the influence of FTO polymorphisms on foetal growth is unknown. Associations between the FTO variant rs9939609 and the foetal growth trajectories, maternal pregnancy weight gain, anthropometric measures at birth and body mass index (BMI) at age 14 years were assessed in 1079 singleton-birth Australian Caucasians. Analyses were repeated in 3512 singleton-birth Dutch Caucasians. The rs9939609 obesity-risk AA genotype was associated with symmetrical intrauterine growth restriction; an effect reversed in mothers who smoked during pregnancy. The effect increased over time and was modified by maternal smoking for head circumference (P = 0.007), abdominal circumference (P = 0.007), femur length (P = 0.02) and estimated foetal weight (P = 0.001). The modification of the association between the AA genotype and birth anthropometrics by maternal smoking was consistent across birth weight (P = 0.01) and birth length (P = 0.04) and neonatal day 2 anthropometry. Consistent associations were replicated in the Generation R cohort. Maternal pregnancy weight gain matched the pattern of birth weight and was independent of placental weight. In adolescents, the AA genotype was associated with increased BMI-adjusted-for-age in males (P = 0.00009), but no effect was detected in females. A variant in the FTO gene influences foetal growth trajectories in the third trimester, early postnatal growth and adiposity in adolescence. Maternal smoking during pregnancy reversed the direction of association of rs9939609 on foetal growth, which was probably mediated by maternal energy intake. The detection of genetic variants associated with foetal growth has the potential to identify novel molecular mechanisms underlying growth and targeted early life intervention. Copyrigh

    Discovering joint associations between disease and gene pairs with a novel similarity test

    Get PDF
    Genes in a functional pathway can have complex interactions. A gene might activate or suppress another gene, so it is of interest to test joint associations of gene pairs. To simultaneously detect the joint association between disease and two genes (or two chromosomal regions), we propose a new test with the use of genomic similarities. Our test is designed to detect epistasis in the absence of main effects, main effects in the absence of epistasis, or the presence of both main effects and epistasis. Results: The simulation results show that our similarity test with the matching measure is more powerful than the Pearson's chi(2) test when the disease mutants were introduced at common haplotypes, but is less powerful when the disease mutants were introduced at rare haplotypes. Our similarity tests with the counting measures are more sensitive to marker informativity and linkage disequilibrium patterns, and thus are often inferior to the similarity test with the matching measure and the Pearson 's chi(2) test. Conclusions: In detecting joint associations between disease and gene pairs, our similarity test is a complementary method to the Pearson's chi(2) test

    Multilevel analysis of systolic blood pressure and ACE gene I/D polymorphism in 438 Swedish families – a public health perspective

    Get PDF
    BACKGROUND: Individuals belonging to the same family share a number of genetic as well as environmental circumstances that may condition a common SBP level. Among the genetic factors, the angiotensin converting enzyme (ACE) gene I/D polymorphism appears as a possible candidate as it might influence both SBP and the pharmacological effect of ACE inhibitors. We aimed to combine genetic epidemiology with public health ideas concerning life-course and multilevel epidemiology in order to understand the role of familial factors regarding individual SBP. METHODS: We applied multilevel regression analysis on 1926 individuals nested within 438 families from South Sweden. Modelling familial SBP variance as a function of age and use of ACE inhibitors we calculates a variance partition coefficient and the proportional change in familial SBP variance attributable to differences in ACE gene I/D polymorphism RESULTS: Our results suggest the existence of genetic or environmental circumstances that produce a considerable familial clustering of SBP, especially among individuals using ACE-inhibitors. However, ACE gene I/D polymorphism seems to play a minor role in this context. In addition, familial factors – genetic, environmental or their interaction – shape SBP among non-users of ACE inhibitors but their effect is expressed later in the life-course. CONCLUSION: Strategies directed to prevent hypertension should be launched in younger rather than in older ages and both prevention of hypertension and its treatment with ACE inhibitors should be focused on families rather than on individuals

    Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets

    Get PDF
    Pathway analysis has been proposed as a complement to single SNP analyses in GWAS. This study compared pathway analysis methods using two lung cancer GWAS data sets based on four studies: one a combined data set from Central Europe and Toronto (CETO); the other a combined data set from Germany and MD Anderson (GRMD). We searched the literature for pathway analysis methods that were widely used, representative of other methods, and had available software for performing analysis. We selected the programs EASE, which uses a modified Fishers Exact calculation to test for pathway associations, GenGen (a version of Gene Set Enrichment Analysis (GSEA)), which uses a Kolmogorov-Smirnov-like running sum statistic as the test statistic, and SLAT, which uses a p-value combination approach. We also included a modified version of the SUMSTAT method (mSUMSTAT), which tests for association by averaging χ2 statistics from genotype association tests. There were nearly 18000 genes available for analysis, following mapping of more than 300,000 SNPs from each data set. These were mapped to 421 GO level 4 gene sets for pathway analysis. Among the methods designed to be robust to biases related to gene size and pathway SNP correlation (GenGen, mSUMSTAT and SLAT), the mSUMSTAT approach identified the most significant pathways (8 in CETO and 1 in GRMD). This included a highly plausible association for the acetylcholine receptor activity pathway in both CETO (FDR≤0.001) and GRMD (FDR = 0.009), although two strong association signals at a single gene cluster (CHRNA3-CHRNA5-CHRNB4) drive this result, complicating its interpretation. Few other replicated associations were found using any of these methods. Difficulty in replicating associations hindered our comparison, but results suggest mSUMSTAT has advantages over the other approaches, and may be a useful pathway analysis tool to use alongside other methods such as the commonly used GSEA (GenGen) approach

    A genetic ensemble approach for gene-gene interaction identification

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>It has now become clear that gene-gene interactions and gene-environment interactions are ubiquitous and fundamental mechanisms for the development of complex diseases. Though a considerable effort has been put into developing statistical models and algorithmic strategies for identifying such interactions, the accurate identification of those genetic interactions has been proven to be very challenging.</p> <p>Methods</p> <p>In this paper, we propose a new approach for identifying such gene-gene and gene-environment interactions underlying complex diseases. This is a hybrid algorithm and it combines genetic algorithm (GA) and an ensemble of classifiers (called genetic ensemble). Using this approach, the original problem of SNP interaction identification is converted into a data mining problem of combinatorial feature selection. By collecting various single nucleotide polymorphisms (SNP) subsets as well as environmental factors generated in multiple GA runs, patterns of gene-gene and gene-environment interactions can be extracted using a simple combinatorial ranking method. Also considered in this study is the idea of combining identification results obtained from multiple algorithms. A novel formula based on pairwise <it>double fault </it>is designed to quantify the degree of complementarity.</p> <p>Conclusions</p> <p>Our simulation study demonstrates that the proposed genetic ensemble algorithm has comparable identification power to Multifactor Dimensionality Reduction (MDR) and is slightly better than Polymorphism Interaction Analysis (PIA), which are the two most popular methods for gene-gene interaction identification. More importantly, the identification results generated by using our genetic ensemble algorithm are highly complementary to those obtained by PIA and MDR. Experimental results from our simulation studies and real world data application also confirm the effectiveness of the proposed genetic ensemble algorithm, as well as the potential benefits of combining identification results from different algorithms.</p

    Discovery of Novel Hypermethylated Genes in Prostate Cancer Using Genomic CpG Island Microarrays

    Get PDF
    BACKGROUND: Promoter and 5' end methylation regulation of tumour suppressor genes is a common feature of many cancers. Such occurrences often lead to the silencing of these key genes and thus they may contribute to the development of cancer, including prostate cancer. METHODOLOGY/PRINCIPAL FINDINGS: In order to identify methylation changes in prostate cancer, we performed a genome-wide analysis of DNA methylation using Agilent human CpG island arrays. Using computational and gene-specific validation approaches we have identified a large number of potential epigenetic biomarkers of prostate cancer. Further validation of candidate genes on a separate cohort of low and high grade prostate cancers by quantitative MethyLight analysis has allowed us to confirm DNA hypermethylation of HOXD3 and BMP7, two genes that may play a role in the development of high grade tumours. We also show that promoter hypermethylation is responsible for downregulated expression of these genes in the DU-145 PCa cell line. CONCLUSIONS/SIGNIFICANCE: This study identifies novel epigenetic biomarkers of prostate cancer and prostate cancer progression, and provides a global assessment of DNA methylation in prostate cancer

    A multifactorial analysis of obesity as CVD risk factor: Use of neural network based methods in a nutrigenetics context

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Obesity is a multifactorial trait, which comprises an independent risk factor for cardiovascular disease (CVD). The aim of the current work is to study the complex etiology beneath obesity and identify genetic variations and/or factors related to nutrition that contribute to its variability. To this end, a set of more than 2300 white subjects who participated in a nutrigenetics study was used. For each subject a total of 63 factors describing genetic variants related to CVD (24 in total), gender, and nutrition (38 in total), e.g. average daily intake in calories and cholesterol, were measured. Each subject was categorized according to body mass index (BMI) as normal (BMI ≤ 25) or overweight (BMI > 25). Two artificial neural network (ANN) based methods were designed and used towards the analysis of the available data. These corresponded to i) a multi-layer feed-forward ANN combined with a parameter decreasing method (PDM-ANN), and ii) a multi-layer feed-forward ANN trained by a hybrid method (GA-ANN) which combines genetic algorithms and the popular back-propagation training algorithm.</p> <p>Results</p> <p>PDM-ANN and GA-ANN were comparatively assessed in terms of their ability to identify the most important factors among the initial 63 variables describing genetic variations, nutrition and gender, able to classify a subject into one of the BMI related classes: normal and overweight. The methods were designed and evaluated using appropriate training and testing sets provided by 3-fold Cross Validation (3-CV) resampling. Classification accuracy, sensitivity, specificity and area under receiver operating characteristics curve were utilized to evaluate the resulted predictive ANN models. The most parsimonious set of factors was obtained by the GA-ANN method and included gender, six genetic variations and 18 nutrition-related variables. The corresponding predictive model was characterized by a mean accuracy equal of 61.46% in the 3-CV testing sets.</p> <p>Conclusions</p> <p>The ANN based methods revealed factors that interactively contribute to obesity trait and provided predictive models with a promising generalization ability. In general, results showed that ANNs and their hybrids can provide useful tools for the study of complex traits in the context of nutrigenetics.</p

    Neural networks for modeling gene-gene interactions in association studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Our aim is to investigate the ability of neural networks to model different two-locus disease models. We conduct a simulation study to compare neural networks with two standard methods, namely logistic regression models and multifactor dimensionality reduction. One hundred data sets are generated for each of six two-locus disease models, which are considered in a low and in a high risk scenario. Two models represent independence, one is a multiplicative model, and three models are epistatic. For each data set, six neural networks (with up to five hidden neurons) and five logistic regression models (the null model, three main effect models, and the full model) with two different codings for the genotype information are fitted. Additionally, the multifactor dimensionality reduction approach is applied.</p> <p>Results</p> <p>The results show that neural networks are more successful in modeling the structure of the underlying disease model than logistic regression models in most of the investigated situations. In our simulation study, neither logistic regression nor multifactor dimensionality reduction are able to correctly identify biological interaction.</p> <p>Conclusions</p> <p>Neural networks are a promising tool to handle complex data situations. However, further research is necessary concerning the interpretation of their parameters.</p

    A Whole-Genome SNP Association Study of NCI60 Cell Line Panel Indicates a Role of Ca2+ Signaling in Selenium Resistance

    Get PDF
    Epidemiological studies have suggested an association between selenium intake and protection from a variety of cancer. Considering this clinical importance of selenium, we aimed to identify the genes associated with resistance to selenium treatment. We have applied a previous methodology developed by our group, which is based on the genetic and pharmacological data publicly available for the NCI60 cancer cell line panel. In short, we have categorized the NCI60 cell lines as selenium resistant and sensitive based on their growth inhibition (GI50) data. Then, we have utilized the Affymetrix 125K SNP chip data available and carried out a genome-wide case-control association study for the selenium sensitive and resistant NCI60 cell lines. Our results showed statistically significant association of four SNPs in 5q33–34, 10q11.2, 10q22.3 and 14q13.1 with selenium resistance. These SNPs were located in introns of the genes encoding for a kinase-scaffolding protein (AKAP6), a membrane protein (SGCD), a channel protein (KCNMA1), and a protein kinase (PRKG1). The knock-down of KCNMA1 by siRNA showed increased sensitivity to selenium in both LNCaP and PC3 cell lines. Furthermore, SNP-SNP interaction (epistasis) analysis indicated the interactions of the SNPs in AKAP6 with SGCD as well as SNPs in AKAP6 with KCNMA1 with each other, assuming additive genetic model. These genes were also all involved in the Ca2+ signaling, which has a direct role in induction of apoptosis and induction of apoptosis in tumor cells is consistent with the chemopreventive action of selenium. Once our findings are further validated, this knowledge can be translated into clinics where individuals who can benefit from the chemopreventive characteristics of the selenium supplementation will be easily identified using a simple DNA analysis
    corecore